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ABSTRACT 


The  CHITRAN  program  computes  the  Chi-square  statistic  for  testing 
the  hypothesis  that  a  random  sample  is  from  a  normal  population.  Also 
given  are  the  degrees  of  freedom  for  the  Chi-square  statistic;  a  bar 
chart  (frequency  distribution),  the  mean,  standard  deviation,  range, 
maximum,  and  the  minimum  of  the  observations  comprising  the  sample. 
Eleven  transformations  on  the  observations  are  possible  before  r.'ne 
calculations  are  performed.  An  optional  feature,  "pooling,"  allows 
the  grouping  of  samples  in  order  to  obtain  a  single  Chi-square  statistic 
for  the  grouped  (pooled)  data. 

C’ilTRAN  is  written  in  FORTRAN  IV  for  the  IBM  7030  (STRETCH) 


computer . 
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INTRODUCTION  AND  SUMMARY 


CHITRAN  (Chi-square  Test  with  Transformations)  is  a  program  to 
statistically  compare  the  agreement  of  the  distribution  of  a  sample  of 
observed  data  with  a  parent  normal  distribution  having  a  mean  and  variance 
equal  to  those  of  the  sample  distribution. 

The  distribution  of  the  observations  in  the  sample  is  classified 
into  a  number  of  intervals  (mutually  exclusive  and  exhaustive)  and  the 
observed  frequency  in  each  interval  is  then  "compared"  with  the  frequency 
that  would  be  expected  in  the  same  interval  if  the  parent  distribution 
were  normal.  Because  the  observed  and  expected  frequencies  seldom  agree, 
the  question  is  whether  the  discrepancies  are  small  enough  to  be  considered 
as  due  to  chance  or  are  so  large  that  the  hypothesis  of  normality  must  be 
rejected.  The  statistic  used  in  CHITRAN  to  make  the  "comparison"  between 
observed  and  expected  frequencies,  i.e.,  to  test  the  hypothesis  of 
normality  is 

m  (f  -CD  )2 

X  =  E  \  1  ZJ 7  with  m-3  degrees  of  freedom, 

where 

fj  is  the  observed  frequency  in  the  jth  interval  ( j=l,2,3, . . . ,m) 

cpj  is  the  expected  frequency  in  the  jt^  interval 
and  m  is  the  number  of  intervals  into  which  the  range  of  the  sample 
distribution  is  divided. 

The  degrees  of  freedom  for  the  Chi-square  statistic  are  calculated 
as  m-3  because  only  the  general  form  of  the  normal  distribution  can  be 
hypothesized,  with  the  mean  and  variance  being  estimated  from  the 
observations  of  the  sample.  Two  degrees  of  freedom  are  subtracted  from  m 


1 


because  of  the  estimation  of  the  two  parameters  and  an  ad&xtional 

degree  of  freedom  is  subtracted  because  the  sum  of  the  expected  frequencies 

is  made  equal  to  the  number  of  observed  frequencies 

The  symbol  x2  is  used  because,  under  the  hypothesis  of  normality, 
this  statistic  is  only  approximately  distributed  as  Chi-square  (x2) >  with 
the  degree  of  approximation  depending  mainly  on  the  size  of  the  expected 
frequencies.  Many  authors  recommend  that  qjj  should  be  at  least  5  for 
each  interval  in  order  that  the  approximation  is  adequate.  (The  minimum 
number  of  expected  observations  in  each  of  the  m  intervals  is  an  input 
parameter  in  CHITRAN  and  adjacent  intervals  are  combined  until  this 
minimum  requirement  is  met.)  Assuming  that  the  expected  frequency  in 
each  interval  is  large  enough  that  is  approximately  distributed  as 
X2,  the  upper  tail  of  the  tabled  x2  distribution  can  be  used  as  a 
critical  region  to  test  the  hypothesis  of  normality.  Examples  of  the 
test  are  given  in  section  V.C. 

The  Chi-square  test  for  normality,  accredited  to  Ku. 1  Pearson 
in  1900  (with  contributions  by  R.  A.  Fisher  in  1924),  is  often  used 
even  if  the  the  sample  is  known  to  be  non-norma] .  For  example,  physical 
considerations  may  restrict  the  sample  observations  to  non-negative  values, 
but  the  normal  distribution  may  give  sufficient  approximation  to  the 
sample  distribution  over  the  range  of  the  sample  variate.  The  possibility 
of  approximating  a  sample  distribution  by  the  normal  distribution  is 
often  investigated  because  of  the  characteristics  of  the  normal  curve, 
namely  a  single  mode  with  the  frequencies  decreasing  symmetrically  on 
both  sides  of  the  mode,  characteristics  which  are  often  found  in 
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physical  phenomena.  If  the  normal  approximation  can  be  considered 
adequate,  there  is  a  considerable  advantage  in  that  the  normal 
distribution  is  relatively  easy  to  handle  mathematically  because  of  the 
extensive  tabulation  of  the  standard  normal  distribution  to  be  found  in 
the  literature. 

The  numerical  value  of  the  Chi-square  statistic  (x2) >  although 
serving  as  a  criterion  to  test  the  hypothesis  of  normality  (or  approximate 
normality)  does  not  indicate  the  way  in  which  the  sample  data  disagrees 
with  the  normal  distribution  if  the  hypothesis  is  rejected.  In  CHITRAM, 
therefore,  a  bar  chart  of  the  observed  frequencies  is  printed,  which 
sometimes  serves  as  a  visual  aid  in  determining  the  actual  form  of  the 
sample  distribution  if  the  hypothesis  of  normality  must  be  rejected. 

The  utility  of  the  bar  chart  as  a  visual  aid  in  case  of  a  non-normal 
distribution  can  be  illustrated  by  the  example  problem  (section  V) . 

Suppose  it  is  not  known  that  the  twc  samples  in  the  example  problem 
are  approximately  log-normally  distributed.  Visual  inspection  of  the 
bar  charts  of  the  two  samples  indicates  that  the  distributions  are 
skewed  such  that  there  is  a  high  concentration  of  small  values  of  the 
variate  and  relatively  few  large  values.  Since  this  type  of  skewress 
is  characteristic  of  the  log-normal  distribution,  the  bar  chart  should 
lead  the  analyst  to  consider  the  possibility  that  the  parent  populations 
are  distributed  log-normally  (or  approximately  log-normally) .  The 
example  problem  also  illustrates  the  aid  of  the  transformations  in 
determining  the  actual  form  of  a  non-normal  distribution.  If,  for 
example,  the  hypothesis  of  normality  must  be  rejected  and  the  bar  chart 
indicates  a  skewness  of  the  type  described  above,  the  Chi-square  test 
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can  then  be  performed  on  the  logarithms  of  the  original  values  in  an 
attempt  to  confirm  the  hypothesis  that  the  parent  distribution  is 
actually  log-normal. 

As  many  as  500  samples  may  be  processed  by  CH1TRAN  with  a 

maximum  of  14,000  observations  in  each  sample.  Up  to  200  "different" 

Chi-square  tests  may  be  performed  on  each  sample  of  data,  where  each 

Chi-square  test  begins  with  a  different  number  of  intervals,  none  of 

which  may  exceed  400.  The  capability  of  performing  more  than  one 

Chi-square  test  on  a  sample  of  data  is  included  in  CHITRAN  because  the 

numerical  value  of  the  Chi-square  statistic  and  its  degrees  of  freedom 

» 

are  dependent  upon  the  number  of  intervals  into  which  the  sample 
distribution  is  divided.  Performing  several  Chi-square  tests,  each 
with  a  different  number  of  intervals,  helps  protect  against  the 
possibility  of  incorrectly  rejecting  or  incorrectly  accepting  the 
hypothesis  of  normality  on  the  basis  of  a  single  Chi-square  test.  For 
example,  if  10  Chi-square  tests  (each  with  a  different  number  of  intervals) 
were  performed  on  the  same  sample  of  data,  and  the  hypothesis  of  normality 
was  found  to  be  acceptable  in  9  of  the  tests  and  rejected  in  1,  the 
multiple  testing  would  minimize  the  effect  of  choosing  by  chance,  in  a 
single  Chi-square  test,  that  number  of  intervals  which  led  to  rejection. 

An  optional  feature,  "pooling,"  is  incorporated  into  CHITRAN  to 
allow  the  grouping  of  samples  in  order  to  obtain  a  single  Chi-square 
statistic  for  the  grouped  (pooled)  data.  This  feature  \s  included  in 
CHITRAN  primarily  for  use  in  analysis  of  variance,  where  normality  of 
the  residuals  in  the  cells  of  the  classification  is  essential  in  the 
construction  of  confidence  limits  and/or  tolerance  limits  and  is  also 
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required  for  tests  of  significance.  CHITRAN  can  perform  the  Chi-square 
test  only  on  the  pooled  sample  or  on  the  pooled  sample  and  the  individual 
samples.  (Naturally,  the  pooled  sample  option  need  not  be  exercised,  in 
which  case  the  Chi-square  test  is  performed  only  on  the  individual 
samples.)  The  upper  limit  for  the  total  number  of  observations  in 
the  pooled  sample  is  1,779,200. 

Discussions  of  the  Chi-square  test  for  normality  and  examples  of 
its  application  can  be  found  in  much  of  the  literature  (see,  for  example. 
Burr  [1953]  or  Wadsworth  and  Bryan  [I960]).  A  comprehensive  discussion 
of  the  test,  including  its  development,  application  and  limitations  can 
be  found  in  Cochran  [1952].  Cochran  also  discusses  alternative  tests 
and  tests  which  are  supplementary  to  the  Chi-square  test. 

IX.  COMPUTATIONAL  PROCEDURE 

The  operations  described  in  this  section  are  performed  for  each 
sample  of  data,  after  the  transformation,  or  transformations,  specified 
on  Card  Type  2  (see  section  III  .A. 2)  have  been  performed  on  the  n  sample 
observations.  If  the  pooling  option  has  been  exercised,  the  pooled 
sample  is  formed  as  follows  before  the  operations  are  performed.  In 
each  of  the  samples  that  are  to  be  pooled,  the  sample  mean  is  subtracted 
from  the  observations  in  the  sample,  i.e.,  the  mean  of  the  sample  is 
subtracted  from  each  A  the  n  observations  of  the  sample.  I  he  mean 
of  each  individual  sample  comprising  the  pooled  sample  is,  therefore, 
translated  to  zero.  The  observations  of  each  sample  are  then  grouped 
into  a  single  pooled  sample  (with  mean  =  0) .  The  computational 
procedure  is  described  in  this  section  for  an  individual  sample  but  is 
also  valid  for  the  pooled  sample  except  for  the  following  variations  . 
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The  minimum  number,  INTOVP,  of  expected  observations  in  each  interval 
of  the  pooled  sample  and  the  number  of  intervals,  INTERP(I),  into  which 
the  pooled  sample  frequency  distribution  is  to  be  divided  need  not  agree 
with  INTOV  and  INTER($,  the  corresponding  input  parameters  for  the 
individual  samples.  In  case  of  a  pooled  sample,  therefore,  the  values 
of  INTOVP  and  INTERP(I)  on  Card  Type  4  must  be  substituted  for  INTOV 
and  INTER(I),  respectively,  in  the  description  of.  the  computational 
procedure.  The  only  other  deviation  in  procedure  for  the  pooled  sample 

is,  as  described  later,  the  calculation  of  the  degrees  of  freedom. 

£  Xj  |  f  (xj -x)a 

The  mean,  — ,  and  the  standard  deviation,  -J-LtJ; - ,  of  the 

n  \  n-1 

n  sample  observations  are  computed  and  printed. 

A  frequency  distribution  of  the  sample  observations  is  then  formed 
in  the  following  way.  The  maximum,  xB#x,  and  the  minimum,  xn , n ,  of  the 
observations  are  determined  and  the  range  of  the  sample  frequency 
distribution  is  computed  as  xoax  -  xBln.  In  order  to  obtain  the  interval 
width,  D,  the  range  is  divided  by  the  input  value  of  the  desired  number 
of  intervals,  INTER(I) ,  specified  on  Card  Type  5.  The  maximum,  minimum, 
range,  and  the  interval  width  are  printed  as  output.  The  upper  bounds  of 

each  of  the  INTER(I)  intervals  are  then  computed  by  adding  ID,  2D,  3D,  - , 

j:D,  - ,  INTER(I)D,  respectively,  to  xB j n .  The  maximum,  xfflax,  is 

thereby  determined  as  the  upper  bound  of  the  last  interval.  The  upper 
bounds  are  then  printed  as  output. 

NOTE:  j'  is  used  to  denote  the  general  interval  of  the  INTER(D 

interval  into  which  the  sample  frequency  distribution  is  divided 
according  to  input  specifications,  j  is  later  used  to  denote  the 
general  interval  of  the  m  intervals  that  result  from  the  restriction 
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that  a  minimum  number  of  expected  observations  must  be  in  each 
interval.  The  j'th  interval  may  or  may  not  coincide  with  the 
j1-*1  interval. 

Each  observation  is  then  assigned  to  its  proper  interval,  i.e.,  to  the 
"first"  interval  which  has  an  upper  bound  that  is  larger  than  the  obser¬ 
vation.  In  other  words,  the  observation  xt  is  assigned  to  the  j'th 
interval  if  x,  is  £  xBln  +  (j'-l)D  but  <  xB , n  +  j'D.  The  number  of 
observations  in  each  interval  is  counted  and  printed  as  output  under 
the  heading  "FREQUENCY."  The  count  in  each  interval  is  printed  to  the 
right  of  the  upper  bound  of  the  interval.  A  bar  chart  is  then  printed 
in  which  each  observation  is  represented  by  an  "X" . 

NOTE:  If  the  number  of  observations  in  any  interval  exceeds  65, 
the  number  of  X's  used  in  the  bar  chart  to  represent  the  obser¬ 
vations  is  scaled  such  that  only  65  X's  are  printed  for  the  interval 
with  the  largest  observed  frequency.  This  scaling  is  performed  in 
order  to  preserve  the  relative  proportions  of  the  bar  chart,  which 
would  otherwise  be  lost  because  of  printing  limitations.  The 
following  method  is  used  when  scaling  is  necessary.  The  maximum 
observed  frequency  is  divided  by  65,  giving  the  number  of  obser¬ 
vations  to  be  represented  by  each  printed  X.  For  example,  if  the 

98 

maximum  frequency  is  98,  each  X  represents  —  =  1.5077  observations 

65 

and  the  statement  "X  REPRESENTS  1.5077  OBSERVATIONS"  is  printed 
as  output.  The  number  of  X's  to  be  printed  in  this  case  for  each 
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interval  is  then  obtained  by  dividing  the  observed  frequency  in 
each  interval  by  1.5077.  The  number  of  X's  printed  is  always 
rounded  off  to  the  nearest  integer. 

After  the  bar  chart  has  been  formed,  the  quantity  -  3  is 

computed,  where  INTOV  (see  Card  Type  5,  columns  6-10)  is  the  minimum 

for  the  expected  number  of  observations  to  be  found  in  each  interval. 

If  -  3  is  ^  0,  the  observed  frequencies  and  the  bar  chart  are 

INToV  ’ 

printed  and  the  statement  "CHISQUARE  COULD  NOT  BE  COMPUTED"  is  printed. 

No  attempt  is  made  in  this  case  to  compute  the  Chi-square  statistic 

because  of  the  joint  consequence  of  (a)  the  restriction  that  cp^  ,  the 

expected  number  of  observations  in  each  interval,  must  be  greater  than 

INTOV  and  (b)  the  definition  of  the  degrees  of  freedom  for  the  Chi-square 

statistic  as  the  number  of  intervals,  m  (for  which  >  INTOV),  minus  3. 

If  the  quantity  — - —  -  3  is  ^  0,  the  degrees  of  freedom  could  not  be 
INTOV 

>  0  and  further  computations  would,  therefore,  be  meaningless. 

If  the  quantity  — Q -  3  is  >  0,  an  attempt  is  made  to  compute 

INTOV 

the  Chi-square  statistic.  The  expected  fre juency  distribution  is  formed. 
This  distribution  gives  the  number  of  observations  that  would  be  expected 
in  each  of  the  INTER(I)  intervals  if  the  sample  of  n  observations  was 
actually  from  a  parent  normal  distribution  having  a  mean  and  standard 
deviation  equal  to  those  of  the  sample  observations.  Under  the  hypothesis 
of  normality,  the  expected  frequency  (cp^*  )  in  each  of  the  INTER(I)  intervals 
is  obtained  by  multiplying  the  total  number  of  observations,  n,  by  the 
probability  that  an  observation  will  be  in  a  given  interval.  The 
probabilities  are  computed  by  a  system  subroutine  which  uses  the 
standardized  normal  distribution  function, 


8 


P(X) 


1 

/2TT 


Consequently,  the  upper  bounds  in  each  interval  must  be  standardized  by 
subtracting  the  sample  mean,  x,  and  dividing  by  the  sample  standard 
deviation,  s.  The  probability  that  an  observation  will  be  in  the  first 
interval  is  then 


dx , 


where  T1  is  the  smallest  standardized  upper  bound.  The  probability  that 
an  observation  will  be  in  the  second  interval  is  obtained  by  integrating 
from  Tx  to  T2  ,  where  T2  is  the  standardized  upper  bound  of  the  second 
interval.  This  procedure  is  continued  until  probabilities  have  been 
computed  for  each  interval. 

Each  time  an  expected  frequency,  cpj«  ,  is  found  for  an  interval 
(cty  =  the  number  of  sample  observations  multiplied  by  the  probability 
that  an  observation  is  in  the  j'th  interval),  the  value  of  qy  is  tested 
to  determine  if  it  is  INTOV .  If  is  ^  INTOV,  the  expected  frequency 
is  added  to  that  of  the  next  interval,  i.e.,  the  intervals  are  "combined." 
The  procedure  is  continued,  if  necessary,  until  a  combined  interval  does 
have  an  expected  frequency  >  INTOV.  The  expected  frequencies  of 
succeeding  intervals  are  also  examined  and,  if  necessary,  combined  with 
the  next  interval  or  intervals.  Each  time  an  interval  (or  group  of 
combined  intervals)  is  found  to  have  an  expected  frequency  >  INTOV,  the 
expected  frequency  for  the  remaining  intervals  is  examined.  If  INTOV 
or  less  expected  frequencies  remain,  the  remaining  intervals  are  combined 
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with  the  preceding  interval.  In  this  way  m  "new"  intervals  result  from 
the  original  INTER(I)  intervals,  each  of  the  m  intervals  having  an 
expected  frequency,  cp  j  ( j  =  l ,  2  ,  .  .  .  ,m)  ,  greater  than  INTOV  . 

A  count  is  then  made  of  the  number  of  sample  observations,  f j , 
that  actually  fall  in  each  of  the  m  intervals  and  the  contribution  of 
each  interval  to  the  Chi-square  statistic  is  computed  and  printed,  the 
contribution  of  the  j*-*1  interval  being 


The  observed  and  expected  number  of  observations  in  each  of  the  m 
intervals  are  then  printed  as  output  under  the  headings  "OBS  FR"  and 
"EXPD  FR,"  respectively. 

The  quantity  m-3,  the  degrees  of  freedom,  is  computed  and  if 
5  0,  the  statement  "CHISQUARE  COULD  NOT  BE  COMPUTED"  is  printed.  (In 
case  of  a  pooled  sample,  the  number  of  degrees  of  freedom  is  computed  as 
m-K-2,  where  K  -  the  number  of  samples  comprising  the  pooled  sample.) 

If  the  number  of  degrees  of  freedom  is  >  0,  the  Chi-square 
statistic  is  computed  by  summing  the  contributions  of  each  of  the  m 
intervals ,  i .e . , 


m 


i  =  l 


The  value  of  \  is  printed  as  output,  along  with  the  degrees  of  freedom. 
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III.  INPUT  PREPARATION 


A .  Problem  Deck  Setup 

The  problem  deck  is  listed  below  by  card  type.  More  than 
one  card  of  a  given  type  may  be  necessary  for  a  specific  problem.  Two 
sets  of  Card  Type  1  to  7  may  be  processed  at  one  Lime,  i.e.,  another 
problem  deck  may  be  stacked  behind  the  first  one. 

CARD  TYPE  1  -  VARIABLE  FORMAT  CARD 
CARD  TYPE  2  -  INPUT  CONTROL  CARD 

CARD  TYPE  3  -  POOLED  SAMPLE  IDENTIFICATION  CARD  (OPTIONAL)* 

CARD  TYPE  4  -  POOLED  SAMPLE  CONTROL  CARD  (OPTIONAL)* 

CARD  TYPE  5  -  SAMPLE  CONTROL  CARD 

CARD  TYPE  6  -  SAMPLE  IDENTIF7 1ATION  CARD** 

CARD  TYPE  7  -  SAMPLE  DATA  CARD** 


*  Omit  if  samples  are  not  to  be  pooled. 

**  This  "pair"  of  Card  Types  (6  and  7)  comprises  the  data  input  for  one 
sample.  (More  than  one  Card  Type  7  may  be  necessary.)  These  cards  may 
be  input  on  punched  cards  or  on  tape  as  indicated  in  column  10  of  the 
INPUT  CONTROL  CARD  (Card  Type  2)  . 

1 .  CARD  TYPE  1  -  VARIABLE  FORMAT  CARD 
Column  Format  Program  Variable  Explanation 

1-80  iOA8  F0RM  The  format  specifications  by  which 

each  Card  Type  7  (or  data  input  record, 
if  data  is  on  tape)  is  to  be  read. 

The  format  specifications  must  be 
enclosed  by  parentheses. 
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2.  CARD  TYPE  2  -  INPUT  CONTROL  CARD 


Column  Format  Program  Variable  Explanation 


1-5  15  N0SAM 


10  II  TAPE 


15  II  GR0UP 


19-20  12  N0TRAN 


The  number  of  samples  for  which  the 
Chi-square  test  is  to  be  performed. 
N0SAM  <:  500. 

2  -  The  information  on  Card  Types  6 
and  7  is  on  punched  cards. 

5  -  The  information  on  Card  Types  6 
and  7  is  on  magnetic  tape. 

0  -  Do  not  pool  the  samples,  i.e., 
perform  the  Chi-square  test(s)  only 
on  the  individual  sample(s) . 

1  -  Perform  the  Chi-square  tests  on 
the  individual  samples  and  on  the 
pooled  sample. 

2  -  Perform  the  Chi-square  test  on 
the  pooled  sample  only.  Print  the 
observations  comprising  the  pooled 
sample . 

3  -  Perform  the  Chi-square  test  on  the 
pooled  sample  only.  Do  not  print  the 
observations . 

The  number  of  transformations  which 
are  to  be  performed  on  all  observations 
comprising  each  of  the  samples.  The 
Chi-square  test  is  performed  on  each 
transformed  set  of  data.  If  more  than 
one  transformation  is  desired,  the 
input  data  must  be  on  tape  (i.e.,  if 
N0TRAN  >  1,  then  TAPE  =  5) .  X  to  X 
is  considered  to  be  a  transformation, 
therefore,  N0TRAN  =  1  if  the  Chi-square 
test  is  to  be  performed  on  the  original 
observations  only.  1  s  N0TRAN  11. 
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Card  Type  2  (Cont'd) 

Column  Format  Program  Variable  Explanation 


21-25  15  IT(  1) 


26-30  15  IT (2) 


71-75  15  IT (11) 

Transformations: 


An  integer  designating  the  first 
transformation  to  be  performed  on 
the  observations  .  See  the  list  of 
11  available  transforma  .ions  below 
and  the  corresponding  integer  to  be 
entered . 

There  are  N0TRAN  of  these  entries. 

If  N0TRAN  =  2,  e.g.,  the  first 
transformation  is  specified  by  an 
integer  in  columns  21  to  25  and  the 
second  by  an  integer  in  columns  26 
to  30.  The  remaining  columns  are 
left  blank.  If  only  transformation 
number  6  ( the  square  root)  is  desired, 
for  example,  a  six  is  entered  in 
column  25  and  the  remaining  columns 
are  left  blank. 


The  following  integers  are  used  to  designate  the  available 
transformations: 


1 .  X  ♦-  X. 

2.  X  ♦-  In  X. 

3.  X  «-  ln[  ln(X)  ]  . 

4.  X  <-  ln(  1+X)  . 

5.  X  «-  [  l+ln(  1+X)  ]  . 

6.  X  ♦-  /X .  This  transformation  is  identified  as  SQRT  X  in  the 

printout 

7.  X 

8.  X  ♦-  1+-. 

X 

9.  X  sin-1X.  This  transformation  is  identified  as  ARCS  IN  (X)  . 

10.  X  2  sin'Vx.  Identified  as  2*  ARCSIN(SQRT(X) )  . 

11.  X  4-  sin_1/x.  Identified  as  ARCS  IN  (SQRT  (X)  )  . 
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column 

Column 

1-80 


Column 

1-5 

6-10 


11-15 


16-20 

21-25 


76-80 


3.  CARD  TYPE  3  -  POOLED  SAMPLE  IDENTIFICATION  CARD  (OPTIONAL) 
This  card  is  omitted  if  the  samples  are  not  pooled,  i.e.,  if 
15  of  Card  Type  2=0. 

Format  Program  Variable  Explanation 

1QA8  P00LID  Eighty  columns  to  be  used  for  the 

identification  of  the  pooled  sample. 


4.  CARD  TYPE  4  -  POOLED  SAMPLE  CONTROL  CARD  (OPTIONAL) 
This  card  is  omitted  if  the  samples  are  not  pooled. 


Format  Program  Variable  Explanation 


5X  - 

15  INT0VP 

15  N0INTP 

15  INTERP(l)  > 

15  INTERP(2) 

15  INTERP(l) 

15  INTERP( 13) 


These  columns  are  not  read  by  the 
program  and  may,  therefore,  be  used 
for  additional  identification  of  the 
problem . 

The  minimum  for  the  theoretical 
(expected)  number  of  observations 
which  must  be  in  each  of  the  m 
intervals  used  in  calculating  the 
Chi-square  statistic  for  the  pooled 
sample.  5  is  recommended  as  a  lower 
limit  for  this  number. 

The  number  of  Chi-square  tests  to  be 
performed  on  the  pooled  sample,  where 
each  Chi-square  test  begins  with  a 
different  number  of  intervals,  i.e., 
a  different  number,  INTERP(I) .  (See 
below.)  N0INTP  ^  100. 

The  number  of  intervals  into  which 
the  range  of  pooled  observations  is 
to  be  divided  for  the  Chi-square  tests. 
There  are  N0INTP  of  these  entries. 

For  example,  if  three  Chi-square  tests 
are  desired  on  the  pooled  sample  (i.e., 
N0INTP  =  3  in  column  15) ,  with  the  range 
to  be  divided  into  40,  50  and  60  intervals, 
respectively,  then  40  must  be  entered 
in  columns  19-20,  50  in  columns  24-25 
and  60  in  columns  29-30.  The  value 
of  INTERP(I)  must  be  <  400. 


L4 


If  N0INTP  >  13  but  <  30,  continue  on  an  additional  Card  Type 
4  in  the  following  manner: 

1-5  15  INTERP( 14) 

76-80  15  INTERP(29) 

If  N0INTP  >  29,  continue  on  additional  card(s)  in  the 
same  format  as  the  second  Card  Type  4. 

5.  CARD  TYPE  5  -  SAMPLE  CONTROL  CARD 

There  are  N0SAM  (see  columns  1-5,  Card  Type  2)  of  these 
sample  control  cards,  one  for  each  sample  of  input  data.  If  the  number 
of  samples  is  greater  than  1,  the  sample  control  cards  are  stacked 
consecutively  and  must  be  in  the  same  order  in  which  the  samples  are 
input  on  Card  Types  6  and  7.  If  GR0UP  =  2  or  3  (column  15,  Card  Type 
2),  i.e.,  if  the  Chi-square  test  is  to  be  performed  only  on  the  pooled 
sample,  then  only  N0BS  (columns  1-5)  and  N0INT  =  1  (column  15)  must  be 
entered  on  Card  Type  5. 

Column  Format  Program  Variable  Explanation 

1-5  15  N0BS  The  number  of  observations  comprising 

the  kth  sample  (assuming  this  is  the 
kth  sample  control  card).  N0BS  £  14,000. 

6-10  15  INT0V  The  minimum  for  the  theoretical  (expected) 

number  of  observations  which  must  be  in 
each  of  the  m  intervals  used  in  calculating 
the  Chi-square  statistic  for  the  k*1*1 
sample.  5  is  recommended  as  a  lower 
limit  for  this  number. 


15 


Card  Type  5  (Cont'd) 

Explanation 

The  number  of  Chi-square  tests  to  be 
performed  on  the  kth  sample,  where 
each  Chi-square  test  begins  with  a 
different  number  of  intervals,  i.e., 
a  different  number,  INTER(I)  .  (See 
below.)  N0INT  £  200. 

The  number  of  intervals  into  which  the 
range  of  the  observations  of  the 
sample  is  to  be  divided  for  the  Chi- 
square  tests.  There  are  N0INT  of 
these  entries. 

For  example,  if  two  Chi-square  tests 
are  desired  on  the  k-h  sample  (i.e., 
N0INT  =  2  in  column  15) ,  with  the  range 
to  be  divided  into  10  and  15  intervals, 
respectively,  then  10  must  be  entered 
in  columns  19-20  and  15  in  columns 
24-25.  The  value  of  INTER(I)  must  be 
£  400. 

If  N0INT  >  13  but  <  30,  continue  on  an  additional  Card  Type 
5  in  the  following  manner: 

1-5  15  INTER  ( 14) 


•  »  • 

76-80  13  INTER  (29) 

If  N0INT  >  29,  continue  on  additional  card(s)  in  the  same 
format  as  described  above  for  the  second  Card  Type  5. 


6.  CARD  TYPE  6  -  SAMPLE  IDENTIFICATION  CARD 
The  data  input  for  each  sample  consists  of  a  "pair'5  of  card 
types,  the  sample  identification  card  (Card  Type  6)  and  the  sample  data 
card  or  cards  (Card  Type  7)  .  If  there  is  more  than  one  sample,  the  pair 


Column  Format  Program  Variable 


11-15 

15 

N0INT 

16-20 

15 

INTER(l) 

21-25 

• 

15 

• 

INTER(2) 

• 

• 

15 

INTER  (I) 

■ 

76-80 

• 

• 

15 

INTER( 13) 
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of  card  types  for  sample  number  2  follows  immediately  after  those  for 
sample  number  1,  additional  pairs  following  for  each  of  the  remaining 
samples  (if  any) .  The  pairs  of  card  types  must  be  in  the  same  order 
as  their  corresponding  sample  control  cards  (Card  Type  5) , 

Column  Format  Program  Variable  Explanation 

1-80  1QA8  EDENT  Eight/  columns  to  be  used  for  the 

identification  of  the  observations 
comprising  the  kth  sample.  The 
observations  follow  on  Card  Type  7. 


7.  CARD  TYPE  7  -  SAMPLE  DATA.  CARD 
Column  Format  Program  Variable  Explanation 

1-80  Variable  0BSERV  The  numerical  values  of  the 

observations  comprising  the  k1-*1 
sample.  These  values  must  be  input 
in  accordance  with  the  format 
specified  on  Card  Type  1  (Variable 
Format  Card) .  If  more  than  one 
Card  Type  7  is  necessary  for  the  k1-*1 
sample,  the  data  is  continued  on 
succeeding  cards . 
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B .  Job  Request  Sheets 


A  1401  request  sheet  is  necessary  in  order  to  'ave  Card 
Types  6  and  7  put  on  tape.  After  the  data  has  been  put  on  tape,  the 
proper  tape  number  must  be  entered  on  the  7030  job  request  sheet  as 
shown  below.  If  Card  Types  6  and  7  are  not  on  tape  but  are  punched 
cards  in  the  problem  deck,  a  scratch  tape  instead  of  a  specific  tape 
should  be  called  for  on  the  7030  job  request  sheet. 
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70S0  JOB  REQUEST  prnc.nwl- «i?3o/?9  I 


The  number  of  the  tape  containing  the  data  on  Card  Types 
6  and  7  must  be  punched  in  the  IOD  deck  (for  example,  4156  is  punched 
in  place  of  XXXX  on  the  "REEL,  PULXXXX"  card  of  the  IOD  deck  if  Card 
Types  6  and  7  are  on  tape  4156) . 
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CHISQUARE  =  Z  J~  «  WITH  m-3  DEGREES  OF  FREEDOM 


V. 


EXAMPLE  PROBLEM 


A .  Description  of  Example  Problem 

The  example  problem  consists  of  two  samples,  one  with  150 
observations  (sample  No.  1)  and  the  other  with  125  observations 
(sample  No.  2).  Both  samples  are  approximately  log-normally  distributed 
(The  observations  of  sample  No.  1  are  the  antilogs  of  150  numbers 
generated  from  an  approximately  normal  distribution  with  mean  =  2.50 
and  standard  deviation  =  0.30.  The  observations  of  sample  No.  2  are 
the  antilogs  of  125  numbers  generated  from  an  approximately  normal 
distribution  with  mean  =  2.20  and  standard  deviation  =  0.30.)  Samples 
No.  1  and  2  should,  therefore,  appear  to  be  from  non-normal  parent 
populations,  but  the  logarithmically  transformed  observations  of  both 
samples  should  appear  to  have  been  sampled  from  normal  parent  populations. 

In  the  example  problem,  the  Chi-square  test  was  performed  on 
the  original  values  (X  to  X  transformation)  and  on  the  logarithms  (X  to 
In  X  transformation)  for  both  sample  No.  1  and  sample  No.  2. 

Two  sets  of  intervals  (20  and  25)  were  chosen  for  sample  No. 

1,  i.e.,  two  Chi-square  tests  were  performed  on  the  original  values  of 
sample  No.  1  and  two  on  the  logarithms  of  the  original  values. 

Only  one  set  of  intervals  (15)  was  chosen  for  sample  No.  2, 
amounting  to  two  Chi-square  tests,  one  on  the  original  values  of  sample 
No.  2  and  one  on  the  logarithms. 

The  option  for  pooling  the  two  samples  was  exercised,  with 
only  one  set  of  intervals  (40)  being  chosen  for  the  pooled  sample.  Two 
Chi-square  tests  were  performed,  therefore,  on  the  pooled  sample,  one  on 
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the  pooled  original  observations  and  one  on  the  pooled  logarithms  of 
the  original  observations.  Therefore,  a  total  of  8  Chi-square  tests 
were  performed  for  the  example  problem.  In  each  of  the  8  tests  the 
minimum  number  of  expected  observations  in  each  interval  was  specified 
as  5 . 

This  example  problem  was  fabricated  merely  to  illustrate 
the  various  features  of  the  CHITRAN  program,  especially  the  transformation 
option,  and  is  not  intended  as  a  guide  to  the  choice  of  input  parameters 
for  the  general  problem. 

B .  Listing  of  Input  Cards  for  the  Example  Problem 

A  listing  of  the  input  for  the  previously  described  example 
problem  is  given  on  the  following  two  pages.  Because  two  transformations 
(X  to  X  and  X  to  In  X)  were  specified,  it  was  necessary  to  put  Card 
Types  6  and  7  on  tape . 
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C .  Test  Results  and  Example  Output 

Test  Results 

The  output  from  the  example  problem  is  given  here  in  order 
to  exhibit  a  sample  of  the  program  output  and  also  to  illustrate  the 
the  use  of  the  x2  statistic  in  testing  the  hypothesis  of  normality. 

The  first  Chi-square  test  performed  in  the  example  problem  is 
performed  on  sample  No.  1  (with  150  observations).  The  sample  frequency 

distribution  is  divided  into  20  intervals,  i.e.,  INTER ( 1)  =  20  on  the 

first  Card  Type  5  of  the  input  deck.  The  "transformation"  used  is 
X  to  X.  The  computed  x2  statistic  for  this  first  test  is  30.64  with 
12  degrees  of  freedom.  Because  y2  (12  degrees  of  freedom)  is  approximately 
distributed  as  x2  (12  degrees  of  freedom),  the  upper  tail  of  the  tabled 
X2  distribution  can  be  used  as  a  critical  (rejection)  region  for  testing 
the  hypothesis  of  normality.  For  any  given  significance  level,  a,  where 
a  is  the  probability  of  rejecting  the  hypothesis  of  normality  when  the 
hypothesis  is  true,  the  lower  bound  of  the  critical  region  is  defined 

as  that  value  of  (12  degrees  of  freedom)  for  which  the  probability  of 

X  (12  degrees  of  freedom)  being  greater  than  or  equal  to  x2  (12  degrees 
of  freedom)  is  equal  to  o.  At  the  a  =  0.01  level  of  significance,  for 
example,  the  tabled  value  of  x2  (12  degrees  of  freedom)  is  26.22.  Testing 
at  the  170  level  of  significance,  therefore,  the  critical  region  is 
v2  >  26.22.  Since  the  computed  value  of  x2  (=  30.64)  is  within  the 
critical  region,  the  hypothesis  of  normality  must  be  rejected  at  this 
level  of  signif irance .  (Rejection  of  normality  is  to  be  expected  since 
this  sample  of  data  is,  as  mentioned,  approximately  log  normally  distributed.) 


26 


As  a  second  example  of  the  application  of  the  y2  statistic 
in  testing  the  hypothesis  of  normality  and  as  an  illustration  of  the 
effect  of  the  logarithmic  transformation,  consider  the  fifth  Chi-square 
test  printed  in  the  output.  This  Chi-square  test  uses  the  same  input 
parameters  as  the  first  test,  but  the  fifth  test  is  performed  on  the 
logarithms  of  sample  No.  1,  not  on  the  original  values.  The  fifth  test 
resulted  in  a  y2  value  of  15.79  with  12  degrees  of  freedom.  Testing  at 
the  y  -  .01  significance  level,  the  critical  region  is,  again,  y2  >  26.22. 
Since  y2  is  not  i-n  the  critical  region,  the  hypothesis  of  normality 
cannot  be  rejected,  for  this  sample,  at  the  .01  level  of  significance. 
(Again,  this  conclusion  is  to  be  expected  since  the  logarithms  of  the 
observations  of  an  approximately  log  normal  distribution  should  be 
approximately  normally  distributed.)  It  should  be  mentioned  that 
although  the  number  of  degrees  of  freedom  (12)  calculated  for  the.  fifth 
test  was  the  same  as  that  calculated  for  the  first  test,  the  degrees  of 
freedom  of  y2  for  the  transformed  observations  of  a  sample  do  not  always 
agree  with  the  degrees  of  freedom  for  the  original  observations. 

Ttie  other  six  y2  statistics  calculated  for  the  example  problem 
(and  given  in  the  example  output)  may  be  tested  in  a  similar  manner  against 
tabled  yp  values.  The  tabled  y2  distribution  is  not  included  in  this 
report  since  it  can  be  found  in  many  elementary  statistical  texts. 
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Example  Output 
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VI.  FLOW  CHARTS 


START 


REWIND  DISK  10 


1 

READ  VARIABLJ 

t _ 

S  F0RMAT  CARD 

> 

READ  INPUT  C0NTR0L  CARD 

IPRINT=1 

GR0UP=GR0UP+1 
t 


1 


GR0UP=4 

YES 

IPRINT=2 

? 

GR0UP=3 

N0 


SAVE=GR0UP 


GR0UP=1 

* 


N0 


YES 


READ  IDENTIFICATI0N  F0R  P00LED 
SAMPLE 

.  . ~l 


READ  THE  P00L  C0NTR0L  CARD 


1=0 


READ  A  SAMPLE  C0NTR0L  CARD  k 


I 


1=1+1 


WRITE  THE  SAMPLE  C0NTR0L 

I=N0SAM 

N0 

CARD  0N  DISK  10 

W 

? 

- - 

YES 


1=0 


YES 


I=N0TRAN 


N0 


Ah 


1+1 


4 - J 

1  I 

CALL  SIT5R0UT INE  CHI  WITH 

_ ± 

IG0  IS  SET  =  T0  THE  Ith 
TRANS F0RMATI0N  NUMBER  GIVEN 
0N  THE  INPUT  C0NTR0L  CARD. 


REWIND  TAPE 


TAPE 

* 


THE  IG0th  TRANS F0RMATI0N 
FUNCTI0N  AS  ITS  ARGUMENT 


GR0UP=SAVE 
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START 


SET  UP  THE  TDENT  F0RMAT  T0 
SPECIFY  THE  TRANSFORMATION 
WHICH  IS  BEING  HANDLED 


REWIND  DISK 
REWIND  DISK 


10 

12 


JJX-JJX+1  k 


JJX-0 


COMPUTE  STANDARD  DEVIATION 

YES 

GROUP-4 

FOR  THE  POOLED  SAMPLE 

? 

- 1 - 

NO 


IGG=  1 
SUM2P=0 .0 
FN0BSP-O.O 
IDFF=3 
N0G-N0SAM 


READ  SAMPLE  CONTROL  CARD 
VALUES  FOR  THE  JJXth 
SAMPLE  FROM  DISK  10 


NUINT-N0INT 

READ  SAMPLE  IDENTIFICATION 

purine  _Mrtnc 

ANn  SAMPLE  VAI.TIES  FROM  THE 

SUM-0.0 

INPUT  SOURCE  DEFINED  BY 

SUM2-0 .0 

TAPE 

TRANSFORM  THE  OBSERVATIONS  OF 
THIS  SAMPLE  AND  SUM  THE 
TRANSFORMED  VALUES  AS  THE 
VALUE  SUM 


FURTHER  REFERENCE  T0  OBSERVATION 
WELL  REFER  TO  TRANSFORMED 
OBSERVATION 


COMPUTE  MEAN  =  THE  MEAN  OF  THE 

COMPUTE  THE  STANDARD  DEVIATION  OF 

OBSERVATIONS  OF  THIS  SAMPLE 

THE  OBSERVATIONS  OF  THIS  SAMPLE 

VALUE  OF  GROUP 


VALUE  OF  GROUP  ♦- 


STOP 


M  STOP 


UPDATE  VALUES  NEEDED 

STORE  THE  OBSERVATIONS 

TO  COMPUTE  THE  RANGE,  THE 

OF  THIS  SAMPLE,  WITH 

NUMBER  OF  OBSERVATIONS,  AND 

MEAN  SUBTRACTED  FROM 

4— 

THE  STANDARD  DEVIATION  OF 

EACH  OBSERVATION,  ON 
DISK  12 

THE  OBSERVATIONS  OF  THE 
POOLED  SAMPLE 
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COMPUTE  THE  RANGE  F0R 
THIS  (P0SSIBLY  P00LED) 
SAMPLE 


M  1=0 


N0 


I=J+1 


I 


JJX-NDG 

? 


INTERV  IS  SET=INTER(  I)  IF  AN 
INDIVIDUAL  SAMPLE  IS  BEING 
PROCESSED ;  EQUAL  T0  INTERP(I) 
IF  THE  POOLED  SAMPLE  IS  BEING 
PROCESSED 


DIVIDE  THE  RANGE  INT0  THE 
NUMBER  0F  INTERVALS  SPECIFIED| 
BY  INTERV.  ST0RE  THE  UPPER 
BOUNDS  0F  EACH  INTERVAL  INT0 
THE  STEP  ARRAY 


YES 


VALUE  0F  GR0UP 


1 


VALUE  0F  GR0UP 


RETURN 

T0 

CALLING 

PROGRAM 


12 


SUBTRACT  MEAN  FR0M  EACH  0F 
THE  BOUNDS  IN  THE  STEP 
ARRAY  AND  ST0RE  THE  RESULT 
INTO  THE  STIP  ARRAY 


I 


-- STgP  | 


COMPUTE  F REQUENCY  DISTRIBUTION 
FOR  THIS  SAMPLE  CORRESPONDING 
TO  TIE  INTERVALS  DEFINED  BY 
THE  STEP  ARRAY 


PRINT  HEADING  AND  IDENTIFICATION 
FOR  THIS  X' TEST 


IGG r:  2 


GR0UP=4 

IDFF=N0SAM+2 

N0G=1 

NUINT=N0INTP 
INT0V = INTOV P 


SET  UP  CONTROLS  F0R 
THE  POOLED  SAMPLE 


2  I 


(  „  ) 
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43 
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44 
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IS  THE  THEORETICAL 

number  Of  observations 

IN  THE  J,  J+L,  . . . ,  INTERV 
INTERVALS  >  INTOV? 


FO  IS  SET=TO  ITS  CURRENT 
VALUE  PLUS  THE  NUMBER  OF 
REMAINING  OBSERVATIONS , 
i.e.,  PLUS  (03-F0T) 


KOUNT  COUNTS  THE 
NUMBER  OF  CONTIG¬ 
UOUS  SETS  OF  INTER¬ 
VALS  HAVING  MORE 
THAN  INTOV  THEO¬ 
RETICAL  OBSERVA¬ 
TIONS  . 


FO  IS  SET  -  0  TO 
SUM  THE  OBSERVED 
FREQUENCIES  FOR 
THE  NEXT  SET  OF 
INTERVALS 


PROBO  IS  SET  EQUAL  TO  PROBN  TO 
DEFINE  THE  LOWER  BOUND  PROBABILITY 
USED  IN  COMPUTING  THE  THEORETICAL 
NUMBER  OF  OBSERVATIONS  FOR  THE 
NEXT  SET  OF  INTERVALS 


YES  1  J  J  =  INTERV? 


SET  CHI(J)=-I .0  FOR 
J=JJ,  JJ+1 . INTMI 


1  INTERV 

YES 

DEGREES  OF  FREEDOM  (IDF) 

? 

KOUNT -IDFF 

RETURN 

TO 

CALLING 

PROGRAM 


VII.  PROGRAM  LISTING 


T  SUBTYPE. F!OD 


B 

2D 

IOD, ^READER 

B 

3D 

!  OD , *PR I NTER 

B 

5D 

ICO, TAPE,  ,  ,  .EVEN.  ,cavE 

B 

reel .pul 

B 

1  2B 

IOD, D I SK , , ,2899 

B 

1  OB 

I  OD , D I SK ...  1 0 1 

END 

T 

SUBTYPE.F0RTRAN.LMAP.PB1N 

r  chjtran-Chi  square  tfct  with 

COMMON  MAx,M[N,OBSERV(140rV>i NOB  S  , STiPlAOOIt  I  frf  q ( a  0 1  )  , SOE  V  «  1 DF 
COMMON  CH5 ( 4  00  )  iCh] SUv  <  ! OBp  R (400)  , CMPFP ( 40C  > 

COMMON  [NTOV.ENOBF,  INTeP(lEO),F3QAPH(65)»pTEP(40C)  .TAPE,  IGC 
COMMON  NO JOBS. FORM  (  10)  .POOL  !D(  10  )  .  08SERP  (  14  000  )  .  IN  TERR (  100) 

COMMON  NOBS A ( 500  )  ,  I N  TO VD . NO  I  NT  P , CROUP .  1 NTERV .  IPRINT.O 
D1 MENS  ION  I  T (  1 2  ) 

EQUIVALENCE ( NOS AM , NO JOBS ) 

INTEGER  TAPE .GROUP. SAVE .Q 

EXTERNAL  T 1 ,T2.T3,T4,T5»T6»T7,T8.T9,T10«T11 
99  REW I ND  10 

C  READ  VARIABLE  FORMAT  CARD.THIF  FCRVAT  IS  USED  TO  READ  THE  OBSERVATIONS 
READ  400, FORM 
4  00  FORMAT  (  1  0 A8  ) 

0  =  0 

READ  lO^.NOSAM  , T APE .GROUP .notran «  (  I T { I  )  , j = I  , NOTRAN ) 

I  ^0  FORMAT (  1 6  I B  ) 

IF (GROUP. GT . 3 ) STOP 
1 PQINT= t 
GROUP--GROUP+  1 

IF ( GROUP. GT. 1  . ANO.NOSAM.EQ.  1  ) GROUP =1 
IF (GR0UP.LE.3  )G0  TO  96 
I PRINT=2 
GROUP  =  3 
R8  SAvEiGROUP 

IF (GROUP. FO. 1 )GC  TO  201 
O  OFAD  ID  F'.n  THF  pooled  DATA. 

QF tr  /. fi n  ,  POOL  I  D 
C  READ  POOL  NTROL  CARD. 

READ  1  •  1  ,  INTOVP.NOINTF,  {  I NTFRP ( L )  «L  *  1  .NOINTP) 

II  FORMAT  (Hx,  IBIS/ ((  ))  ) 

1  DO  2~0  1=1,  NO  JOBS 

o  Pc AD  CONTROL  CAQDF  FOR  EACH  camplE. 

READ  100.  NOBS  .INT0V.N01NT,  (  JMFR(L)  .  L  =  1  .NOINT  ) 

NOB  S  A  (  I  ) =  NOBS 

=  WR I TF  (  i  D  ) NOBS  .  I NTOV .NO  I  NT ,  (  I NTER  (<),<= 1  ,  NO  I  NT  ) 

00  V"  1  =  1  .  NOTRAN 
I  GO  = I T (  |  ) 

GROUP -SAVF 

GO  TO  I  1  «"',"1«4,F,H,T,PfO,]'',;  ]  )  ,  I  GO 

1  CALL  " H I  ( T 1  ) 

GO  TC  2p'1 

2  CALL  Chi ( T?  ) 

GO  TO 

1  GALL  Cm  I  (  T  3  ) 

GO  to  ?c,f 
4  fall  FH  |  (  T4  1 
C-0  T ~  c 
K  FALL  M  I  (  T  H  ) 

"  0  T  ,1  C 

<•  (ail  -  -  !  (  T  h  ) 


4e 


GO  TO  250 
CALL  CH I  ( T7  ) 

GO  TO  250 
CALL  CHI  ( TO  ) 

GO  TO  2^0 
CALL  chi (TO) 

GO  T  )  250 
CALL  CHI  <  T 1 0 ) 

GO  TO  250 
CALL  CH I  ( T  1  1  ) 

I F ( T APE -2  )  25  1  ,300*251 
REWIND  TAPE 

CONT I NUE 
GO  >0  90 

END 

Subtype .fortran, lmap.pb  in 

SUBROUT  I NE  CH I  ( TFUNC 1 

COMMON  MAX  .  M  I  N,  OBSERV  (  MOOO  )  ,  NOBS.ST  I  P  (  400  )  *  I  FREQ  (  A  0  l  )  ,SDEV,  IDF 
COMMON  CHS ( 4  00 )  .CHI  SUM,  I OBFR ( 4 00  >  ,  CMPFR  (4  00 ) 

COMMON  I  NTOV.FNOBS  ,  INTER  (  1  00  )  .FGRAPH  (65  )  ♦  STEP  <400  )  ♦  tape  «  I  C-0 
COMMON  NO JOBS. form (10) .POOL  I D (  10)  , OBSERP  < 14000)  ,  I NTERP (  100) 
COMMON  NOBS  A  (500  ).  I  NTOVD.NO  IMP,  GROUP.  I  NTERV  »  I  PR  I  NT  ,  Q 

DIMENSION  T DENT ( 6 ) , T ADD (  1  1  )  ,  TADD2  <  1  1  ) 

DIMENSION  I  DENT (10) 

EQUIVALENCE  (BLANkr ,  TA0D2  (  1  )  )  ,  <  XXX  .T  ADD  (  1  )  ) 

REAL  MEAN, MAX, MIN, MAXP.MJNP,  I  FREQ,  I OBFR 
INTEGER  TAPE.GROJP.Q 
DATA  TDENT < 1  )  (8HTRANSE0R ) 

DATA  TDENT (2  )  (8HMATI0N  X) 

DATA  TDENT  <3  )  (0H  TO  ) 

DATA (T ADD (  I  )  ,  I « 1  , 1  1  )  (PHX, 

1 HLN ( 1+LN ( «  8HSQRT ( X ) . ,BH1 /x. 

2HARCS I N (  ) 

DATA (TADD2 (  I  )  ,  I  *  1  »  1 1  )  ( 8H 
18H1 +X ) ) ,  ,8H  . PH 

28HSQRT (X  )  )  ) 

DATA  PLUSfRH,  ) 

IF (  IGO- 1 0 ) 1 03. 1 04 , 1 03 

104  TDENT ( 6 ) =BOOL ( TADD2 ( 3 ) ) 

GC  TO  1 

103  IF  (  IGO- 1  1  )  1 05, 1 06,  1 C5 

106  TDENT (6 ) =BOOL (PLUS ) 

GO  TO  107 

105  TDENT (6 ) =BOOL (TADD2 (  1)  ) 

107  TDENT ( 4 ) =  BOOL ( T ADD ( I  GO )  ) 

TDEN "(5 ) =BOOL  (TADT2 ( I  GO )  ) 

I GG*  1 

REWIND  10 
PEW  I  NO  |p 

nog  »no  ,nR  c 

cUMpP -  . 

f  ./PRC  p  .  n 

I Df  5  -  > 

loop  no  1^"1  Jjy-I  ,NJOr' 

GO  ’■"I  o*  I  ,-DOur 

20  READ  (  1  "  '  NPH'-  ,  I  N’  OY.NO  I  NT  ,  ,  1  NT  f  »  (  I  1  ,1  =  1  ,  NO  I  NT  ) 

1  format  (|M=i 

DFri’icr  ,  ;  ^|  |  r  ■  -r 
1  on  FORMAT  (  \  '  A"'  v 

re  a r  ;  ~  *  o r  ,  -“p\  .  •  •  .  ;  =  ;  ,  •  jopc  j 


,8HLN  X.  »  8 HLN (LN( X  )  ,BHLN (  1  fX  ) • *  8 
.  8H 1 / ( 1 +X ) . , 8HARCS I N(X.BH2*APCSIN,8 

,SH  .BH),  »  8  H  , 

, 0H  ,8H).  , 8H ( SORT ( X ) « 


7 

S 

9 

10 

1  I 

?50 

751 

poo 


C  COMPUTE  standard  ofviamon  or  the  observations 

NUI NT  =  NO I  NT 

FN05S=N0BS 

SUM  =  0  «  0 

SUM2=0.0 

DO  3  1=1 .NOBS 

ORSER V  (  I  )  =TEUNC  f  OBSERV  ID) 

3  SUM  =  SUM  +08SERV(1) 

__  MEAN  =  SUM/ENOBS 

'  DO  4  1  =  1  .NOBS 

OBSERP (  I  5  *OBSERV (  I  1 -MEAN 

4  SUM?*SUM2+08SERP ( 1 ) *OBSEQP ( I ) 

SDEV  =  SORT (SUM2/ (FNOBS-1  •  )  ) 

CALL  MAXM!N(0BSERV.N0BS«MAX»M1N) 

GO  T0(202. 203. 203, 220 ) .GROUP 

203  IF ( JJX.GT. 1 )GO  TO  208 
MAXP=MAX-MFAN 

M I NP=M I N-MEAN 
GO  T  1  209 

208  MAXP=AMAX1 (MAXP.MAX-MEAN ) 

M I NP : AM INJ  < M J NP , M I N-ME A N ) 

?0Q  SUM?P=SUM?P+SUM2 

enobsp=enobsp+fnobs 

WR I TE (  1 2 )  ( OBSERP (  I  ) ,  I  =  1  » NOBS  1 
GO  T0(220, 202, 1001 .220) .GROUP 

204  MAx=MAxP 

I  N  =  m  I  NP 
ME AN  =  0 • 0 

SDEV=SQRT ( SUM2P/ (FNOBSP-FLOAT (NO JOBS ) > ) 

202  RANGE=MAX-M I N 

DO  1 000  1=1 ,NU INT 

IF (GROUP. LT. 4 >G0  TO  198 
I NTERV= INTERPI I ) 

GO  TO  190 

1  98  I NTER V= INTER (  I  ) 

199  FNTC’DV=  I  NTFRV 

DEL  T  A  =  Q  A  NGF  /FNTFRV 
STEP  M  >  =M I N  +  DELT A 

IFREO (1  )  =0.C 
IFREOf INTERV+1 )=0.0 
~0  5  J  =  2 ,  I N  T  E  p  V 
I F RE  O ( J ) =0. 0 

5  STEP < J ) =STEP ( J-l  l+DELTA 
STEP ( I NTERV ) =MAX 

GO  TO (205,205.220.206) .GROUP 
2^5  DO  6201  L  = 1  .  1 NTEPV 
6?0l  ST’OfL  )  =  FTlrD<L  )-mEAN 
DO  6  <= 1 . NOBS 
JJ« (OBSFRV (< ) -m IN ) /DELTA 

6  I^oSOt  JJ+1  )--IFDEQ(JJ+l  1  +  1.0 

ifp'ci i \tfdv  i  =  i treo ( intervi  +  ifreo( i nterv+  l  ) 

0  =  0  *•  1 

PP  I  NT  1  T  ,  |  PENT 

PRINT  ,  o  ,  N0QS  NTERV  ,  TDENT 

T  TCRVA'  OUARE  TERT,[A,BH  WITh.I6.17H  OBSERVATIONS  AND,  15,1 

I  1  M  rT  CVM 

C  Al.L  *  pc  N  (  RT  !  p,  I  TFF  ,FN0°S  ) 

I  p  t  i  - :  pdoo.t’ 

99  r  ALL  PR  IN'-;  (  J  J  x  , 

P  DC  i  t  #  »  per  nj  y 
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1 3  F OPM AT ( 1 H 1 «  J0A8) 

PRINT  7,0,  NOBS  « I NTERV  *  TDENT 
GO  TO  97 

206  PRINT  13* POOL  I D 
0«0+l 

PRINT  77  «Q  «  FNO0SP » I NTERV  «  TDEN " 

77  FORMAT ( 16H0CH I  SQUARE  TEST.I4.5H  W l TM. F8 • 0 » 24H  POOLED  0E1  .vRVAT  I  ONS 
1  AND.I5.liH  INTERVALS. «2An.A4,3Ae) 

REWIND  12 

DO  368  K*  1  » NO JOBS 

NOBS*NOBSA  (K) 

READ (12) { OBSERV (L ) »  L* 1 .NOBS) 

GO  TO  (444.445) . IPRINT 

444  IF ( I.EQ.l • AND. I GG • EQ * 2 ) CALL  PRINT!  CIC) 

445  DO  66  IFG*1.N0BS 

JJ= (OBSERV ( IFGl-MIN ) /DELTA 
66  IFREO ( J J+l  ) ■ I FREQ ( JJ+1  )♦  1  *0 
368  CONTINUE 

I FREQ ( I NTERV ) *  I FREQ { I NTERV )  + 1 FREQ ( INTERV+1  ) 

I F ( I.GT.l  )G0  TO  207 
GO  TO  (442.207) . IPRINT 
442  GO  TO  (207.447) . I GG 
447  PRINT  1 3 .POOL  ID 

PRINT  77 .O.FNOBSP. I NTERV .TDENT 

207  CALL  TRAN ( STEP . I DFF .FNOBSP ) 

97  PRINT  88. INTOV. INTERV 

88  FORMAT (  6 1 H  THE  DEGREES  OF  FREEDOM  WAS  COMPUTED  ON  THE  BASIS  OF  AT 
1  LEAST  1 5 . 40H  OBSERVATIONS  IN  EACH  OF  A  SUBSET  Or  THE.I5.11H  I NTERV 
2ALS. ) 

PRINT  30.MEAN.SDEV 

30  FORM  \T ( 6H0MEAN» .  E14.8.23H  ...STANDARD  DEV  I  AT I 0N» . E 1 4 . 8 ) 

PRINT  8, RANGE, MAX, MIN, DELTA 

8  FORMAT  (8H0RANGE  =,E15.8,12H  MAXIMUM  ■.E15.8.12H  MINIMUM  *,E15, 

18.19H  INTERVAL  V T DTS t  =,“15. 8) 

PRINT  886 

888  FORMAT (42H0  UPPER  BOUND  FREQUENCY  BAR  CHART. 50X.35HCHI  SO 

1U  CONTP  OBS  FP  THEO  FQ ) 

CALL  MAXMIN( I  FREQ .  I NTERV , XAM , SC ALE ) 

IF (XAM.GT.65.0 )G0  TO  91 
SCAlE=1 .0 
GO  TO  92 

pi  sc ale =x am/65 . o 
92  DO  9  K  =  1  . INTERV 

I FG9PH * T Frfo ( K ) /SC ALET • 5 
I F ( I FGRPH  »LE. 0 ) GO  TO  12 

11  DO  10  I FG* 1 , I FGRPH 
10  FGRAPHf IFG)=XXX 

12  IFGPPHr IFGRPH+1 

I F ( IFGRPH-66)2033, 2035, 2035 
2033  DO  2027  I FG= I FGRPH , 65 
2027  FGDAPH ( IFG 1 =8LANK 
2 n  35  IF(CHS (K  )  12040,2041  ,2041 
204C  PRINT  2029,STEP(K ) , IFREO (K ) , F GRAPH 
GO  T0  o 

“PINT  2029, STEP (<) , IFPEQ(K),FGRAPH.CHS(K).I0BFR(K),CMPFR(K) 

2^20  FORMAT  (2X,  Fi  <5.4, 2X.F7.0.4X,  1  HI  ,  65A  1  »  1  X  ,  F8 . 3 , 1  X  ,  F  1  0  .  I  .2X.F10.  3) 

9  CONTINUE 
PRINT  6Q4, SCALE 

6°4  FORMAT ( 1 3H0X  REPRESENTS ,F1 0.4 , |4H  OBSERVATIONS.) 

IF (  IDF  >2~4B. 2048, 2040 
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2048  PRINT  2050 

2050  FORMAT (35HL  CH I  SQUARE  COULD  NOT  BE  COMPUTED*) 

GO  TO  1000 

2049  PRIN*  2039*  CH I  SUM • I DF 

2039  FORMAT (12H0CH I  SQUARE  =«F16«6*5H  WITH.I4.20H  DEGREE  *  OF  FREEDOM,) 

1000  CONTINUE 

GO  T0( 1 001 , t 001 « 1 001 ,221 ) .GROUP 

1001  CONTINUE 

GO  T0(221 *223.222.221) .GROUP 

222  I GG  =  2 

223  GP0UP=4 
lDFF«N0J0BS+2 
NOG*  1 

NU I  NT  *  NO  I NTP 
1NT0V= INTOVP 
GO  TO  1002 

220  STOP 

221  RETURN 
END 

T  subtype.fortran.lmap.prin 

SUBROUTINE  TPAN ( STEP . 1  OFF , OB ) 

C  THIS  SUBROUTINE  FITS  A  NORMAL  CuPVE  WITH  MEAN  ZERO  AND  STANOARD 
C  DEVIATION  SDEV  TO  THE  OATA  IN  IFREQ  WHERE  THE  UPPER  BOuNO  OF  EACH 
C  INTERVAL  IS  IN  THE  CORRESPONDING  ENTRY  IN  STEP(I>*  OB  IS  THE  TOTAL 
C  NUMBFR  OF  OBSERVATIONS, THE  ROUTINE  GROUPS  THE  DATA  SO  THAT  THERE  ARE 
C  AT  LEAST  INTOV  THEORETICAL  VALUES  PER  INTERVAL  AND  THEN  COMPUTES  THE 
"  ctat  l  ST  IC  f  CHI  SUM  )  TO  G 1  VF  AN  ESTIMATION  OF  THE  GOODNESS  OF 

C  FIT. OK  BY  I T  FROM  THE  ROUTINE,  IDF  CONTAINS  THE  NUMBER  OF  DEGREES  OF 
r  coEEPOM.CHI  ( j)-_i  if  THE  JTM  INTERVAL  WAS  NOT  THE  LAST  OF  A  GROt.3. 

C  OTHERWISE  IT  CONTAINS  (OBSERVED  FREQUENCY-THEORETICAL  FREQUENCY ) »*2 
C  DIVIDFD  BY  THEORETICAL  FREQUENCY, 

C  IOBFR(J)  IS  .ON  EXIT. THE  OBSERVED  FREQUENCY, IF  THE  JTH  INTERVAL  WAS 

C  THE  LAST  OF  4  GROUP , OTHERW I SE  ITS  CONTENTS  ARE  MEAN I NGLESS *L I KEW I SE , 

C  CMPFR ( j 1  CONTAINS  the  THEORETICAL  FREQUENCIES* 

COMMON  may ,mj  N.OBSERV ( 14  000 ) . NOBS . ST  I P ( 4Q0 ) » I FREQ (401  ) ,SDEV, IDF 
COMMON  CHI  (400) , CH I  SUM ,  I OBFR ( 400 ) »  CMPFR ( 400  ) 

Common  INTOV.FNOBS. INTER (100) . FGRAPH (65 ) .STAP(400 ) .TAPE. I  GO 
COMMON  NO JOBS .FORM (10) .ROOL ID ( 10) ,0BSERP( 14000 ) » INTERP( 100 ) 

COMMON  NOBS  A (500) «  INTOVP, NOINTP, GROUP. I NTERV « I  PR  I  NT 

DIMENSION  STEP (400) 

PEAL  max .M IN, IFREQ. I OBFR 
FNT0V= INTOV 
F  OT=  ),0 

<oun  r=o 
chi sum=o,c 
PP0B0=0.0 
P0=0.0 

INTM1 = INTFPV-I 

3  00  10  J= I . I NTERV 

IF (J»NE, INTERVIGO  TO  1 
POO*. s 1 ,0 

GO  TO  2 

1  PC06N=FREC ( STEP ( J)/SDEV) 

?  FOC*OB* (PROBN-PROBO ) 

FC  = IFREQ ( J  >+F0 
IF (POC -FNTOv ) 4 , 4 , 5 

4  CHI ( J)=-l ,0 
GO  TO  1 0 

5  POTsFOT+FO 

RE*  A [N=OB*  < 1 , o—PPOBO ) 
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I F ( OB* ( 1 ,0-PROBN)-FNTOV>6.6,9 
6  FO=FO+<OB-FOT) 

FOMRE*FO-  REMAIN 

CH I  ( I NTERV ) «FOMRE  4FOMRE/REMA I N 

I OBFR ( INTERV)*FO 

CMPFP( I NTERV ) =  REMA I N 

CH I SUM=CH I SUM+CH I ( I NTERV) 

<OUNT=KOUNT+l 

JJ*J 

GO  TO  12 
9  FOMFOC=FO-FOC 

CHI ( J)=FOMFOC*FOMFOC/FOC 
CHI SUM=CHISUM+CHI (J) 

I OBFR ( J ) *FO 
CMPFR{ j)=FOC 
KOUN*=KOUNT+l 
FO*0.0 
PROBO*PROBN 
10  CONTINUE 
GO  TO  17 

12  IF(JJ-INTM1  )14,14,17 
14  DO  16  J  =  J  J  *  I  NTM 1 

16  CHI ( J)*-l .0 

17  idf*kount-idff 

RETURN 

END 

T  «UBTVPE. FORTRAN, LMAP«PBIN 

subroutine  MAXMJN(A«K, AMAXfAMIN) 

DIMENSION  A (14000) 

AMAX* A ( 1 ) 

AM IN«A ( 1 ) 

DO  900  I*2,< 

IF ( AMAx-A ( I ) ) 1 ,5,5 
1  AMAX*A(!) 

GO  TO  900 

5  IF ( AMI N-A ( I ) )900, 900,3 
3  AM  IN* A (  I  ) 

900  CONTINUE 
?20  RETfjON 
END 

T  SUBTYPE, FORTRAN, LMAP,PBIN 

SUBROUTINE  PRINTl(K) 

COMMON  MAX, MIN, OBSERV( 1*000 ) , NOBS , ST  I P ( 400 ) , I FREQ (401  ) , SDEV, IDF 
COMMON  CHS ( 400 ) , CH I  SUM , I OBFR ( 400 ) , CMPFR ( 400 ) 

COMMON  I NTOV ,FNOBS , I NTER (100), FGRAPH ( 65 ) « STEP ( 400  > , TAPE , I  GO 
COMMON  NO JOBS .FORM (10) .POOL IC ( 1 0 ) . OBSERP ( 1 4000 ) • I NTERP (100) 

COMMON  NOBSA (500 ) , I NTOV®, NO I NTP, GROUP, I NTERV , IPRINT 
PRINT  91 .< 

9  1  FORMAT (A4HO A  LISTING  OF  THE  OBSERVATIONS  IN  SAMPLE  N0«I4,9H  FOLLOW 
1 S./1H0 ) 

LC*9 

96  DO  9?  LL=l.NOPS»P 
LC*LC*1 
<M=LL+7 

IF  (N  !Sr--<M  )4!  .95.95 
41  xm=nop- 

9*  POINT  °?,LC.  (OBSERVM.  ) ,L*LL.KM) 

93  FORMAT (1H  .I5.3H)  .SE15.8) 

92  CONTINUE 
RETURN 
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END 


T  subtype, fortran, lmap.pb I N 

FUNCT ION  T1 (X ) 

T  1  3  X 

PETUPN 

END 

T  SUBTYPE, FORTRAN, LMAP.PBIN 

FUNCTION  T2  (X  ) 

T2= ALOG (X  ) 

RETURN 

END 

T  subtype, fortran, lmap.pbin 

function  t3(xj 
T3=AL0G ( ALOG (X ) ) 

PFTURN 

END 

T  <5UBTYPE*FORTRAN*LMAP«PBIN 

FUNCTION  T4(X> 

T4= ALOG ( 1 ,0  +  X ) 

RETURN 

END 

t  subtype, fortran, lmap.pbin 

FUNCTION  T 5 ( X ) 

T5*AL0G (  1 .0  +  ALOG  < 1  • 0  +  X )  > 

RETURN 

END 

T  SUBTYPE, FORTRAN, L^AP.PBIN 

FUNC-ION  T6  <  X ) 

T6=«URT (X ) 

RETURN 

END 

T  SUBTYPE. FORTRAN, LMAP.PBIN 

FUNCTION  T7(X) 

T7= 1 . 0/X 

RETURN 

END 

T  subtype, fortran, lmap.pbin 

FUNCTION  TB ( X ) 

T8= 1 • 0/ ; 1 ,0  +  X ) 

RETURN 

END 

T  SUBTYPE, FORTRAN, LMAP.PBIN 

FUNCTION  T9 ( X ) 

TP=4T4N (X/EOOT ( 1 . 0-X*X ) ) 

RETURN 

END 

T  SU8TYPE, FORTRAN, LMAP,PBIN 

FUNCTION  TIO(X) 

T 1 0*2,0 ♦AT  AN ( FORT  ( X  )  /CQR  T  (  1  *  C  —  X )  ) 

RETURN 

END 

T  SUBT  voo  ,  FORTRAN  ,  (.  VAP  ,  P0  I  N 

FUNC  T  I  ON  Till'/) 

T I  |  j4T4\ f  FORT (X ) /SORT (1  ,0-X)  ) 

RETURN 

END 
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